A pr 2 01 2 BAYESIAN CENTROID ESTIMATION FOR MOTIF DISCOVERY
نویسنده
چکیده
Biological sequences may contain patterns that are signal important biomolecular functions; a classical example is regulation of gene expression by transcription factors that bind to specific patterns in genomic promoter regions. In motif discovery we are given a set of sequences that share a common motif and aim to identify not only the motif composition, but also the binding sites in each sequence of the set. We present a Bayesian model that is an extended version of the model adopted by the Gibbs motif sampler, and propose a new centroid estimator that arises from a refined and meaningful loss function for binding site inference. We discuss the main advantages of centroid estimation for motif discovery, including computational convenience, and how its principled derivation offers further insights about the posterior distribution of binding site configurations. We also illustrate, using simulated and real datasets, that the centroid estimator can differ from the maximum a posteriori estimator.
منابع مشابه
Bayesian Centroid Estimation for Motif Discovery
Biological sequences may contain patterns that signal important biomolecular functions; a classical example is regulation of gene expression by transcription factors that bind to specific patterns in genomic promoter regions. In motif discovery we are given a set of sequences that share a common motif and aim to identify not only the motif composition, but also the binding sites in each sequenc...
متن کاملDevelopment of an Efficient Hybrid Method for Motif Discovery in DNA Sequences
This work presents a hybrid method for motif discovery in DNA sequences. The proposed method called SPSO-Lk, borrows the concept of Chebyshev polynomials and uses the stochastic local search to improve the performance of the basic PSO algorithm as a motif finder. The Chebyshev polynomial concept encourages us to use a linear combination of previously discovered velocities beyond that proposed b...
متن کامل3,5,7-Trimethoxy-2-(4-methoxyphenyl)-4H-1-benzopyran-4-one
In the title compound, C(19)H(18)O(6), also known as 3,4',5,7-tetra-methoxy-flavone, the dihedral angle between the benzopyran-4-one group and the attached benzene ring is 11.23 (8)°. An intra-molecular C-H⋯O hydrogen bond generates an S(6) ring motif. In the crystal, mol-ecules are linked into a two-dimensional network parallel to (01) by inter-molecular C-H⋯O hydrogen bonds, which generate R(...
متن کاملCRLB Calculations for Joint AoA, AoD and Multipath Gain Estimation in Millimeter Wave Wireless Networks
In this report we present an analysis of the non-random and the Bayesian Cramer-Rao lower bound (CRLB) for the joint estimation of angle-of-arrival (AoA), angle-of-departure (AoD), and the multipath amplitudes, for the millimeter-wave (mmWave) wireless networks. Our analysis is applicable to multipath channels with Gaussian noise and independent path parameters. Numerical results based on unifo...
متن کاملThe MODIS software for word like motif discovery and its use for zero resource audio summarization
MODIS is a free audio motif discovery software developed at IRISA Rennes. Motif discovery is the task of discovering and collecting occurrences of repeating patterns in the absence of prior knowledge, or training material. In the case of speech, those motifs could be word since MODIS is tolerant to motif variability. The algorithm implementation allows to process large audio streams at a reason...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012